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Objective: To validate an automated cerebellar segmentation method based on active shape and appearance 
modeling and then segment the cerebellum on images acquired from adolescents with histories of prenatal alco- 
hol exposure (PAE) and non-exposed controls (NC). 

Methods: Automated segmentations of the total cerebellum, right and left cerebellar hemispheres, and three 
vermal lobes (anterior, lobules I-V; superior posterior, lobules VI-VII; inferior posterior, lobules VIII-X) were 
compared to expert manual labelings on 20 subjects, studied twice, that were not used for model training. The 
method was also used to segment the cerebellum on 1 1 PAE and 9 NC adolescents. 

Results: The test-retest intraclass correlation coefficients (ICCs) of the automated method were greater than 0.94 
for all cerebellar volume and mid-sagittal vermal area measures, comparable or better than the test-retest ICCs 
for manual measurement (all ICCs > 0.92). The ICCs computed on all four cerebellar measurements (manual and 
automated measures on the repeat scans) to compare comparability were above 0.97 for non- vermis parcels, and 
above 0.89 for vermis parcels. When applied to patients, the automated method detected smaller cerebellar vol- 
umes and mid-sagittal areas in the PAE group compared to controls (p < 0.05 for all regions except the superior 
posterior lobe, consistent with prior studies). 

Discussion: These results demonstrate excellent reliability and validity of automated cerebellar volume and mid- 
sagittal area measurements, compared to manual measurements. These data also illustrate that this new technol- 
ogy for automatically delineating the cerebellum leads to conclusions regarding the effects of prenatal alcohol 
exposure on the cerebellum consistent with prior studies that used labor intensive manual delineation, even 
with a very small sample. 

© 2014 The Authors. Published by Elsevier Inc. All rights reserved. 



1. Introduction 

The cerebellum is involved in postural control, gait, balance, and co- 
ordination of bilateral movements. Recent work has demonstrated that 
the neocerebellum, comprising of the lateral hemispheres, vermal lob- 
ules VI-VII, and the dentate nucleus, is involved in cognitive tasks, 
such as executive function (Berman et al., 1995; Raichle et al., 1994; 
Schlosser et al, 1998), verbal fluency (Appollonio et al, 1993), verb gen- 
eration (Fiez et al, 1996), working memory (Fiez et al, 1996), and 
source memory (Tamagni et al, 2010). Moreover, the cerebellum is of 
special interest to the study of brain development since it has been 
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shown to have greater vulnerability to damage in the perinatal period 
(Tarn et al, 201 1 ). Despite the clear evidence that the cerebellum un- 
derlies cognitive and motor functions and is vulnerable during 
neurodevelopment, very few human neuroimaging studies have mea- 
sured cerebellar volumes in part due to the paucity and limitations of 
automated cerebellar segmentation algorithms. Instead, imaging stud- 
ies of the human cerebellum have relied on manual segmentation to de- 
lineate the cerebellar hemispheres and vermis. 

Manual segmentation has problems when used in large scale imag- 
ing studies, as it is slow, and error-prone, and consistency between op- 
erators is difficult to obtain and maintain. A few automatic or semi- 
automatic cerebellar segmentation algorithms have been published 
(Cerasa et al, 2009; Hwang et al, 2011; Powell et al, 2008; van der 
Lijn et al, 2012; Weier et al, 2012; Zhao et al, 2010), but these methods 
(including Freesurfer) only provide a delineation of the total cerebellum 
or cerebellar hemispheres (and not the cerebellar vermis). Given that 
the cerebellar vermis has been repeatedly implicated in diseases such 
as alcoholism (Andersen, 2004; Baker et al, 1999; Cavanagh et al., 
1997), fetal alcohol spectrum disorders (Archibald et al, 2001; Maier 
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et al, 1999; O'Hare et al, 2005; Ramadoss et al, 2007; Sowell et al, 
1996), and autism (Brambilla et al, 2003; Scott et al, 2009; Webb 
et al, 2009), these existing cerebellar segmentation methods that do 
not delineate the vermis are highly limited for clinical research. 

Many techniques exist to divide a medical image into regions with 
similar properties such as gray level, color, texture, brightness and con- 
trast. Medical image segmentation based on texture features has been 
explored, but has never gained popularity (Haralick et al, 1973). Early 
techniques based on gray level features such as histogram thresholding, 
edge based segmentation, and region based segmentation generally do 
not perform well on images with complex anatomy such as the brain 
(Sharma and Aggarwal, 2010). Segmentation of the cerebellum is espe- 
cially challenging, because the cerebellar foliations are <0.5 mm, lead- 
ing to extensive partial voluming within the cerebellum at the 
standard 1 mm 3 resolution of Tl -weighted images. Therefore, any 
segmentation method that seeks to delineate more than the outer cere- 
bellar boundary must use prior knowledge. The state of the art and most 
popular methods for medical image segmentation are atlas based 
approaches (e.g., Freesurfer) and model based approaches (e.g., deform- 
able models, active shape and appearance modeling), both of which use 
prior knowledge of the structure to be segmented. Atlas based segmen- 
tation relies on appropriate atlas formation and selection, and accurate 
registration of the image to be segmented to the atlas. At least two 
atlases of normal adult human brain anatomy include delineation of 
the cerebellar vermis and lobes (Diedrichsen et al., 2009; Rohlfing 
et al, 2010), and may be used to parcellate the cerebellum. In our expe- 
rience, atlas-based methods do not provide reliable estimation of cere- 
bellar volumes or mid-sagittal areas (test-retest reliabilities range 
from 0.15 to 0.70, work in progress) and are better suited for integration 
with functional imaging methods to identify anatomical regions where 
functional activation occurs, which is the use recommended by the de- 
velopers of these atlases. Therefore, we have developed a Bayesian im- 
plementation of active appearance modeling for measurement of 
cerebellum volume. In this model the structure of the cerebellum and 
its subparcels is assumed to have a repetitive form of geometry that 
can be modeled probabilistically from hand-delineated examples. The 
probabilistic model of cerebellar geometry can be used to enforce a 
strong prior during segmentation, where candidate segmentations 
matching the model are more likely to be chosen as the correct 
parcellation. 

Accurate automated cerebellar segmentation would be valuable in 
the study of clinical samples, such as fetal alcohol spectrum disorders 
(FASD). Although not all children with histories of prenatal exposure 
to alcohol meet the diagnostic criteria for fetal alcohol syndrome 
(FAS) (Kuehn et al., 2012), many with heavy exposure exhibit function- 
al abnormalities. Children with FASD show difficulties with cognitive 
processes that may be related to alcohol-induced cerebellar dysfunc- 
tion, including memory, verbal learning, and executive functions 
(Mattson et al., 201 1, 2013; Vaurio et al, 201 1 ). Alcohol's negative im- 
pact on cerebellar development is also likely to contribute to deficits 
in motor behaviors observed in children, adolescents and adults prena- 
tally exposed to alcohol, such as longer reaction times, slower move- 
ment, problems generating isometric and isotonic forces, bimanual 
coordination, and postural balance (Connor et al., 2006; Debaere et al., 
2004; Nguyen et al., 2013a, b; Roebuck et al., 1998a, b; Simmons et al., 
2010, 2012). Moreover, neuropathology documented in animal and 
clinical research has repeatedly shown that the cerebellum is particular- 
ly sensitive to early exposure to alcohol (Bauer-Moffett and Altman, 
1975; Goodlett et al, 1990a, b; Hamre and West, 1993; Phillips and 
Cragg, 1982; Thomas et al, 1998), specifically the cerebellar vermis 
(Clarren et al, 1978; Goodlett et al, 1990a; Jones et al, 1973; Peiffer 
et al., 1979; Wisniewski et al., 1983). MRI studies of children prenatally 
exposed to alcohol report reduced volume of the cerebellar hemi- 
spheres and vermis (Autti-Ramo et al, 2002; Mattson et al., 1992, 
1994). Previous quantitative cerebellar imaging studies in FASD have 
used manual tracings to detect smaller cerebellar mid-sagittal vermis 



area (O'Hare et al., 2005; Sowell et al., 1996) or total cerebellar volumes 
(Bookstein et al., 2006). 

In this paper, we use Tl -weighted images to automatically segment 
the cerebellum using a novel algorithm under continuing development 
by Neurobehavioral Research, Inc. (NRI) under an SBIR Phase I grant. We 
demonstrate that our automated cerebellar segmentation has high va- 
lidity when compared to gold-standard manual delineation obtained 
from Neuromorphometrics, Inc., a small business originating from 
the Center for Morphometric Analysis at the Massachusetts General 
Hospital, whose manual delineations have been used to validate accept- 
ed image processing tools such as Freesurfer and FSL In addition, we 
demonstrate the clinical validity of our algorithm by using it to segment 
the cerebellum on a sample of adolescents prenatally exposed to alcohol 
(PAE) and age-matched controls. Based on previous studies that manu- 
ally delineated the cerebellum, we expected that PAE children would 
show smaller total cerebellar volumes and smaller mid-sagittal vermal 
areas of the anterior lobe (vermis lobules I-V) and posterior inferior 
lobe (vermis lobules VIII-X). 

2. Methods 

2.1. Cerebellar Analysis Toolkit (CATK) overview 

CATK functions as a fully-automated Tl MRI cerebellum delineation 
and parcellation tool. An active profile-appearance modeling (AAM; 
Patenaude, 2007; Cootes, 2000; Cootes et al., 2001) framework is used 
and the toolkit is currently capable of parcellating the cerebellum into 
five structures: left and right hemispheres, and three vermal lobes (I-V, 
VI-VII, and VIII-X), although further development is planned to extend 
the algorithm to segmentation of the left and right cerebellar nuclei. The 
CATK is based on statistical shape and appearance models learned from 
a training set of examples hand-delineated by Neuromorphometrics, 
Inc. The advantage of this is that strong prior knowledge about the cere- 
bellum inherent in the data (such as the overall shape of the cerebellar 
vermal lobes) is taken into account during segmentation, resulting in a 
segmentation method that enforces smoothness according to probable 
variations specific to the structures. Fig. 1 shows a broad overview 
of the components of the algorithm. The basic components are: data 
preparation (e.g., intensity normalization, pose correction, cerebellum 
localization), statistical model generation, and model fitting (for new 
samples). 

2.1.1. CATK description 

To create the statistical shape model, the training images are first lin- 
early aligned using a 9 degrees of freedom model (rotation, translation, 
and scale) (Woods et al., 1998a,b) with the ICBM-152 template. Shapes 
are parameterized by a point distribution model (Cootes, 2000; Cootes 
et al, 2001 ) derived from stellar meshes. Once training images are 
aligned, projecting rays from a central point to a label boundary implic- 
itly provides vertex correspondence. For subparcels (like the hemi- 
spheres and vermal lobes), multiple projection centers are used to 
maximize correspondence with the ground truth labels while allowing 
the covariance between related surfaces to be modeled. For example, 
the three lobes of the vermis are modeled by a single statistical model 
containing three surfaces. While the spherical assumption limits the to- 
pology of the mesh, modeling errors are minimal for the designated 
structures, and a number of computational benefits are provided. For in- 
stance, the connectivity of the points is implicit which allows fast con- 
version to triangular mesh surfaces, and models can be easily re- 
sampled at multiple resolutions which enables both coarse and refined 
fitting. The ability to control the shape dimension in this way is a signif- 
icant advantage for statistical modeling as it impacts directly on the 
number of examples required to obtain good shape parameterization. 
The point distribution model projects each high-dimensional shape vec- 
tor (i.e. the raw vertices) into a low-dimensional eigenspace, having at 
maximum as many principal modes as examples in the training set. 
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Fig. 1. Overview of CATK (Cerebellar Automated Toolkit). 



Similarly, a statistical intensity model is generated by sampling 1-D pro- 
files normal to the surface at each vertex. 

Using this statistical model, new images are parcellated by using the 
template alignment (rotation, translation, and scale) to initialize the po- 
sition of the average shape, and then applying a three-stage iterative 
procedure: The first stage refines the initial position while keeping the 
shape parameters fixed, after which the second stage adjusts the verti- 
ces by maximizing the correlation between observed and reference in- 
tensity profiles subject to the closest set of shape parameters. This 
allows the surface to adapt to new examples while ensuring that they 
exist within the subspace of possible shapes. Finally, a refinement 
stage uses conjugate gradient optimization to maximize the conditional 
intensity-shape distribution using a Bayesian formulation. Owing to its 
poor lateral boundary distinction, an additional refinement step is ap- 
plied to central vertices of the vermis in order to improve the accuracy 
of sagittal area measurements. 

2 A 2. CATK training and validation 

Expert manual cerebellar labels were obtained from Neuro- 
morphometrics, Inc. (http://www.neuromorphometrics.com) on 63 
unique subjects, 20 of whom were imaged and labeled in two separate 
imaging studies. The 43 subjects without repeat data were used 
for model training, and the 20 subjects studied twice were used 
for validation of the model. The average age of the 43 subjects used 
for training was 44 ± 30 years (range 5-96 years), with 13 children 
(age 5-17 years), 19 adults (age 18-70 years), and 11 elderly (age 
71-96 years). There were 21 males and 22 females in the training set, 
and only one training subject was left-handed. The average age of the 
20 subjects used for validation was 23 ± 4 years (range 19-34 years). 
There were 8 men and 1 2 women in the validation set, and all validation 
subjects were right-handed. Neuromorphometrics uses protocols for 
defining neuroanatomical regions and custom software that were de- 
veloped originally as part of the Center for Morphometric Analysis at 
the Massachusetts General Hospital (Harvard Medical School), and all 
labels were verified by a neuroanatomy expert. Neuromorphometrics 



provided labeling for the total cerebellum, left and right cerebellar hemi- 
spheres and nuclei, and three cerebellar vermis lobes (lobules I-V, VI-VII, 
andVffl-X). 

2.2. Clinical application in FASD 

Two groups of children between the ages of 1 0 and 1 8 were included 
in this study: 13 children with histories of heavy prenatal alcohol expo- 
sure (PAE group) and 9 non-exposed control children (NC group). 
Children were drawn from a larger study of the behavioral teratogenic- 
ity of alcohol and were recruited via several mechanisms including pro- 
fessional or self-referral and community outreach. Inclusion in this 
study required having English as the primary language, no history 
of head trauma, serious medical condition, or MRI contraindications 
(e.g., non-removable metallic implants, claustrophobia). 

Inclusion in the PAE group required a documentation of heavy ma- 
ternal alcohol consumption during pregnancy, defined as 4 or more 
alcoholic drinks per single occasion at least once per week, or 14 or 
more drinks per week on average. Whenever possible, exposure was 
confirmed using medical history, birth records, social services records, 
or maternal report. However, direct maternal report was not common 
as many of these children no longer resided with their biological fami- 
lies. Thus, precise details about alcohol consumption (i.e., dose and 
timing) were often unavailable. In these cases, mothers were reported 
to be "alcoholic" or alcohol abusing or dependent in pregnancy. In addi- 
tion to psychometric testing and questionnaire screening, a diagnosis of 
FAS was determined by Dr. Kenneth Lyons Jones, a dysmorphologist, 
using a standardized assessment (Jones et al, 2006; Mattson et al., 
2010). FAS was diagnosed in the presence of structural abnormality 
(i.e., two or more of the following facial features: short palpebral fissure 
length, smooth philtrum, thin vermillion border) and either growth de- 
ficiency (height or weight < 1 0%) or microcephaly (occipital-frontal cir- 
cumference < 10%). Based on this screening process, 6 alcohol-exposed 
children received a diagnosis of FAS. Children in the NC group were ex- 
cluded if greater than minimal prenatal alcohol exposure was reported 
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(i.e., < 1 absolute alcoholic drink/day prior to pregnancy recognition). As 
the majority of children in this group reside with their biological 
mothers, teratogenic exposure was determined through direct maternal 
report. Children in the NC group were screened for prenatal alcohol 
exposure through a telephone interview process and completion of a 
parent questionnaire that elicited information concerning alcohol con- 
sumption during pregnancy. 

Informed consent and assent were obtained from the legal guardians 
and participants in accordance with the Institutional Review Boards of 
San Diego State University (SDSU) and the University of California San 
Diego (UCSD). Following informed consent all children were scanned at 
the UCSD Keck Center for Functional MRI on a 3T General Electric Signa 
Excite using an 8 channel head coil. Tl -weighted 3D images were ac- 
quired for 17 subjects with the following parameters: TR, 8 ms; TE, 
3.0 ms; flip angle, 12°; matrix, 256 x 256 x 192; FOV, 240 x 240 mm; 
total acquisition time, 7 min and 4 s. For the remaining five subjects, 
the following scanning parameters were used: TR, 8 ms; TE, 3.1 ms, flip 
angle, 8°; matrix, 256 x 256 x 192, FOV 256 x 256 mm; acquisition 
time, 7 min and 24 s. Final reconstructed voxel size for all images was 
lxlxl mm. 

2.3. Statistics 

2.3.1. CATK algorithm validation 

Intraclass correlation coefficients (ICCs) were used to measure the 
consistency or reproducibility of the manual cerebellar measurements 
compared to the CATK generated automated cerebellar measurement. 
ICCs were computed on the manual labelings of the 20 subjects with 
two separate scan sessions to assess the test-retest reliability of manual 
delineation (ICCs above 0.8 are considered excellent), and also on the 
CATK applied to the two separate scan session data to determine 
CATK test-retest reliability. ICCs were also computed over all four mea- 
surements (two manual, two CATK) on each cerebellar structure (total 
cerebellum, right and left hemispheres, three vermal lobes), to assess 
the agreement between manual and automated measures. We also 
computed the mean Dice Coefficient for each cerebellar measure for 
the two scan sessions of the same 20 subjects used to compute ICCs. 
The Dice Coefficient is a spatial overlap measure between CATK and 
gold-standard labeling, with 100% being perfect (Crum et al., 2005). 
Lastly, we computed the RMS mesh error (0 is perfect) between the 
CATK contours and manual contours. 

2.3.2. Clinical application 

ANOVA was used to compare the PAE and NC groups on measures of 
total cerebellar volume, left and right cerebellar hemisphere volumes, 
and mid-sagittal vermis lobe areas (anterior, superior posterior, posteri- 
or inferior). Microcephaly is a prominent feature of prenatal alcohol ex- 
posure. As a result, we hypothesized that all cerebellar size measures 
would be related to overall head size, but might be different between 
groups. We used the cranium size index calculated by FSL's SIENAX 
(Smith et al., 2002) for our measure of head size, as we have previously 
shown it to be an excellent surrogate for intracranial vault volume (Fein 
et al., 2004). Pearson correlations were used to assess the relationships 
between cranium size and cerebellar size in each group. Linear model- 
ing was then used to determine if group differences in cerebellar size 
measures could be completely explained by head size and vice-versa. 

3. Results 

3.1. CATK algorithm validation 

The test-retest ICCs for CATK were greater than 0.94 for all cerebellar 
volume and mid-sagittal vermal area measures, comparable or better 
than the test-retest ICCs for manual measurement (all ICCs > 0.92). 
The ICCs computed on all four cerebellar measurements (manual and 
CATK measures on the repeat scans) to compare comparability between 



measurements were above 0.97 for non-vermis parcels, while vermal 
mid-sagittal area ICCs were 0.90 for vermis I-V, 0.89 for vermis VI-VII, 
and 0.91 for vermis VIII-X. We obtained mean Dice overlaps of 90% for 
the full cerebellum and 91% for the hemispheres, and 72-79% for the 
vermal lobe volumes. The somewhat lower Dice Coefficients for the 
vermal lobes are attributed to their much smaller overall volume and 
difficulty estimating the lateral border. The mean Dice overlaps for the 
vermis mid-sagittal areas are much improved, ranging from 82 to 88%. 
The RMS mesh error (0 is perfect), which is not affected by overall vol- 
ume, between the CATK contours and manual contours ranged from 1.2 
to 1.8 voxels for all regions. Fig. 2 shows the excellent agreement 
between manual delineation and CATK automated delineation for 
total cerebellum volume and total vermis area, and Fig. 3 shows a 
side-by-side comparison of CATK and manual marking for a representa- 
tive subject. These results demonstrate that CATK is an appropriate and 
validated method for cerebellar segmentation that has comparable 
accuracy to hand delineation. 

Fig. 4 shows three examples demonstrating CATK's fitting of 3D par- 
cels to participants from this study. Good cerebellar panellation is 
achieved on the adolescents, despite the majority of the training sam- 
ples being adult or elderly subjects. In particular, CATK leads to good es- 
timates of the cerebellum and vermis, even when the scans are of poor 
quality (Fig. 4, bottom panel). 

3.2. Clinical application to FASD 

Thirteen PAE children and nine NC children had both complete 
motor assessments and structural imaging, and were included in the 
study. Two children with PAE had cerebellar herniation, an anatomical 
anomaly that was not well-delineated by CATK, and were excluded 
from further analysis of cerebellar measures. The remaining sample de- 
mographics were: 11 PAE (3 girls), 13.2 ± 2.4 years; 9 NC (3 girls), 
12.8 ± 2.1 years. 

When comparing the 1 1 PAE to 9 NC, the PAE group had smaller 
mean total cerebellum (10% smaller volume, p = 0.07), left cerebellar 
hemisphere volume (11% smaller, p = 0.05), and right cerebellar hemi- 
sphere volume (10% smaller, p = 0.07). The PAE group also had signif- 
icantly smaller vermal mid-sagittal areas for the anterior lobe (19% 
smaller, p = 0.006) and the posterior inferior lobe (16% smaller, 
p = 0.002). Table 1 shows cerebellar volumes and mid-sagittal areas 
for the PAE and NC groups. 

Head size was significantly smaller in the PAE group compared to 
the NC group (13% smaller, p = 0.01). Pearson correlations assessed 
the relationship between head size and cerebellar measures in each 
group separately. Within the PAE group, head size was highly correlated 
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Fig. 2. Comparison of manual delineation and automated delineation using the CATK on 20 
subjects who were not used in training, for total cerebellar volume and total cerebellar 
vermis area. 
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Fig. 3. Illustration of automated delineation using CATK (left panel) and manual delineation (right panel) on a representative subject who was not used in training, for left and right cer- 
ebellar hemispheres and all vermal lobes. Top panel shows an overlay onto the Tl -weighted image, while the bottom panel shows each parcel. Green: left cerebellum, red: right cerebel- 
lum, blue: vermis anterior lobe (lobules I-V), cyan: vermis superior posterior lobe (lobules VI-VII), magenta: vermis inferior posterior lobe (lobules VIII-X). (For interpretation of the 
references to colors in this figure legend, the reader is referred to the web version of this article.) 



with all cerebellar size measures (r = 0.56-0.91, p < 0.001-0.08) ex- 
cept the mid-sagittal areas of the superior posterior lobe (r = 0.40, 
p = 0.22) and inferior posterior lobe (r = 0.50, p = 0.12). Within 
the NC group, head size was not significantly correlated with any cere- 
bellar size measure (all p > 0.31) except the inferior posterior lobe 
(r = -0.70, p = 0.04). 




Fig. 4. 3D panellation of: vermis anterior lobe (lobules I-V) blue; vermis superior posterior 
lobe (lobules VI-VII) magenta; vermis inferior posterior lobe (lobules VIII-X) red; hemi- 
spheres, yellow and green. The top and bottom panels show delineation in patients with 
histories of prenatal exposure to alcohol, the middle panel is from an age-matched control. 
The bottom example shows a good fit even for a scan of poor quality. (For interpretation of 
the references to colors in this figure legend, the reader is referred to the web version of 
this article.) 



Linear modeling showed that differences in head size between 
groups disappeared when covarying for any cerebellar size measure. 
Linear modeling showed that the significance of the group differences 
in cerebellar volume measures (total cerebellum, left and right hemi- 
spheres) and mid-sagittal vermal areas (anterior and posterior inferior) 
remained or were enhanced after covarying for head size, as shown by 
the last column in Table 1. 

4. Discussion 

We present a novel, automated method for segmentation of the total 
cerebellum, left and right cerebellar hemispheres, and three lobes of the 
cerebellar vermis (lobules I-V, VI-VII, and VIII-X). We trained our 
method using 43 subjects ranging in age from 5 to 96, and showed im- 
pressive results when tested on a group of subjects ages 19-34 (who 
were not part of the training set). These results demonstrate excellent 
reliability and validity of CATK volume and mid-sagittal area measure- 
ments, compared to manual measurements. Our results also demon- 
strate excellent validity of CATK voxel classifications. 

CATK worked without modification on all subjects reported in this 
paper. Like any automated segmentation method, however, it is not 
foolproof. In our recent experience applying the method to more than 
300 scans of adults and adolescents, the method failed on less than 5% 
of images. Generally the errors were due to preprocessing failures of 
head orientation correction and cerebellum isolation. The major cause 
appeared to be due to errors in the original data where incorrect orien- 
tation labels were assigned in the image headers. Both types of errors 
result in obvious failures easily detected by the user. In some of our 
test cases used for validation, small regions of cerebral cortex are includ- 
ed in our cerebellar segmentation; despite this our validity measures 
were excellent. These errors in segmentation may limit the ease to 
which CATK can be integrated with whole brain analyses. 

ANOVA analyses detected smaller cerebellar volumes and mid- 
sagittal areas, automatically measured using CATK, in PAE compared 
to controls. These results are in agreement with prior reports of smaller 
cerebellum in FAS and with previous studies that observed smaller mid- 
sagittal areas of the anterior lobe and posterior inferior lobes measured 
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Table 1 

Cerebellar size measures for participants with prenatal alcohol exposure (PAE, n = 11) and normal controls (NC, n = 9). Bold values indicate p < 0.10. 



Region 


PAE 


NC 


Group 
p-value 


Group p-value when covarying 
for cranium size * group 


Total volume 


125080 ± 18777 


139251 ± 13048 


0.072 


0.051 


L hemisphere volume 


56533 ± 8158 


63380 ± 6019 


0.051 


0.023 


R hemisphere volume 


57124 ± 8650 


63823 ± 6088 


0.066 


0.022 


Anterior vermis (I-V) area 


355 ± 69 


436 ± 49 


0.006 


0.038 


Superior posterior vermis (VI-VII) area 


223 ± 41 


215 ± 24 


0.631 


0.901 


Inferior posterior vermis (VIII-X) area 


280 ± 37 


334 ± 27 


0.002 


0.007 



manually (O'Hare et al, 2005; Sowell et al, 1996). As with previous re- 
ports, we did not detect smaller mid-sagittal area in the superior inferior 
lobe (lobules VI-VII). The consistency of our results with manual tracing 
in a disease group adds to the validity of our automated cerebellar delin- 
eation method. 

Compared to normal controls, we also found that children prenatally 
exposed to alcohol showed a smaller overall head size. Most cerebellar 
size measures were related to overall head size within the PAE group 
but not the NC group, such that group differences in overall head size 
were explained by cerebellar size measures. However, group differences 
in the cerebellar measures were not completely explained by head size, 
as shown by the last column of Table 1. These results suggest that 
1 ) PAE-related microcephaly is strongly related to cerebellar hemi- 
spheric volumes, and 2) smaller cerebellar measures in FASD are not 
fully explained by microcephaly, and suggest an additional direct effect 
of prenatal alcohol exposure on the cerebellum. 

A limitation of the current study is that we did not measure alcohol 
consumption in the adolescent sample studied. Given this, we cannot 
rule out the possibility that alcohol drinking in the PAE sample might 
be associated with the smaller cerebellar measurements in that group 
vs. controls (Lisdahl et al., 2013). However, the mean age of the partic- 
ipants was only 13 years, so it is possible that current drinking, if any, 
was minimal among all the children since only a small percentage of 
children have begun regular drinking before age 14 (Hingson et al., 
2006). This analysis was also limited by the sample size, underscoring 
the need to replicate our results in a larger sample with a more inclusive 
age range than studied here. Even so, the findings of smaller head size, 
cerebellum volume, and mid-sagittal vermis areas were remarkably 
consistent with prior studies of independent samples, suggesting that 
our subjects were a representative sample of this population. 

In conclusion, these data illustrate that our new technology for auto- 
matically delineating the cerebellum found specific volumetric reduc- 
tions in a FASD sample that are consistent with prior studies that used 
labor intensive manual delineation. Further work will use CATK to de- 
lineate the cerebellum on a larger sample, and associate cerebellar size 
measures with measures of motor and cognitive function, to determine 
the effect of alcohol-related cerebellar damage on function. 
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